On the use of GSV-SVM for Speaker Diarization and Tracking
نویسندگان
چکیده
In this paper, we present the use of Gaussian Supervectors with Support Vector Machines classifiers (GSV-SVM) in an acoustic speaker diarization and a speaker tracking system, compared with a standard Gaussian Mixture Model system based on adapted Universal Background Models (GMM-UBM). GSVSVM systems (which share the adaptation step with the GMMUBM systems) are observed to have comparable performances: for acoustic speaker diarization, the GMM-UBM system outperforms the GSV-SVM system on ESTER2 data but the latter system works better in the speaker tracking system. In particular, the linear combination of two systems at the score level outperforms each individual system.
منابع مشابه
Thinkit Audio Genre Classification System for Mirex08
This full abstract describes our submitted system for the MIREX08 Audio Genre Classification task, the goal of which is to discriminate music excerpts of different genres/styles. The system is based on basic feature of MFCC and modeling framework of GSV-SVM, which has been successfully applied in speaker recognition field. In this submission, the only basic feature we use is MFCC. And the goal ...
متن کاملExperiments on speaker tracking and segmentation in radio broadcast news
In this paper we describe the speaker tracking and clustering system that we implemented for the ESTER evaluation campaign. We present some experiments on normalization in speaker tracking, in particular concerning the use of t-norm for speaker tracking in broadcast news. Results show that the use of t-norm significantly improves the performance at low false alarm rates. In a second part of the...
متن کاملModeling Overlapping Speech using Vector Taylor Series
Current speaker diarization systems typically fail to successfully assign multiple speakers speaking simultaneously. According to previous studies, overlapping errors account for a large proportion of the total errors in multi-party speech diarization. In this work, we propose a new approach using Vector Taylor Series (VTS) to obtain overlapping speech models assuming individual speaker models ...
متن کاملDevelopment of a Speaker Diarization System for Speaker Tracking in Audio Broadcast News: a Case Study
A system for speaker tracking in broadcast-news audio data is presented and the impacts of the main components of the system to the overall speaker-tracking performance are evaluated. The process of speaker tracking in continuous audio streams involves several processing tasks and is therefore treated as a multistage process. The main building blocks of such system include the components for au...
متن کاملVariability compensated support vector machines applied to speaker verification
Speaker verification using SVMs has proven successful, specifically using the GSV Kernel [1] with nuisance attribute projection (NAP) [2]. Also, the recent popularity and success of joint factor analysis [3] has led to promising attempts to use speaker factors directly as SVM features [4]. NAP projection and the use of speaker factors with SVMs are methods of handling variability in SVM speaker...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010